Word Sense Acquisition For Multilingual Text Interpretation
نویسنده
چکیده
Wc discuss ~r method for usi,,g ~ut,om~tted cor pllS ana.1'o111 li(!ws stories with brewed coverage in 7h~p~mv.~(~md English. Our al> pi'o~tch focuses Oll tyiil~ l.otAel.her word SOllS(~s> ilS-ing a. (:olribin~t~ion of workl kuowledge (ontology) with word knowledge (corpus da.t~t). ~;e expla.in l, he approach a.nd it;s results in SI[O(',UN.
منابع مشابه
Comparing methods for automatic acquisition of Topic Signatures
The main goal of this work is to compare two methods for building Topic Signatures, which are vectors of weighted words acquired from large corpora. We used two different software tools, ExRetriever and Infomap, for acquiring Topic Signatures from corpus. Using these tools, we retrieve sense examples from large text collections. Both systems construct a query for each word sense using WordNet. ...
متن کاملUsing Parallel Corpora for Word Sense Disambiguation
Word Sense Disambiguation (WSD) is the Natural Language Processing (NLP) task that consists in selecting the correct sense of a polysemous word in a given context. Most state-of-the-art WSD systems are supervised classifiers that are trained on manually sense-tagged corpora, which are very time-consuming and expensive to build. In order to overcome this acquisition bottleneck (sense-tagged corp...
متن کاملCrossing Parallel Corpora and Multilingual Lexical Databases for WSD
Word Sense Disambiguation (WSD) is the task of selecting the correct sense of a word in a context from a sense repository. Typically, WSD is approached as a supervised classification task to get state-of-the-art performance (e.g. [6]), and thus a large amount of sense-tagged examples for each sense of the word is needed, according to the word-expert approach. This requirement makes the supervis...
متن کاملMultilingual Evaluation of KnowNet Evaluación Multiĺıngüe de KnowNet
This paper presents a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the method uses a wide-coverage and accurate knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate senses to large sets of topically related words acquired from the web. KnowNet, the resulting knowledge-base which conne...
متن کامل(Digital) Goodies from the ERC Wishing Well: BabelNet, Babelfy, Video Games with a Purpose and the Wikipedia Bitaxonomy
Multilinguality is a key feature of today’s Web, and it is this feature that we leverage and exploit in our research work at the Sapienza University of Rome’s Linguistic Computing Laboratory, which I am going to overview and showcase in this talk. I will start by presenting BabelNet 2.5 (Navigli and Ponzetto, 2012), available at http://babelnet.org, a very large multilingual encyclopedic dictio...
متن کامل